WHAT IS CLAIMED: 

1 . A test set for characterizing substrate specificities of kinases comprising at 
least two peptide pools, wherein substantially every peptide in each of the 
peptide pools comprises one phosphorylatable amino acid position, one 
query amino acid position, at least one anchor amino acid position, and at 
least one degenerate amino acid position, and wherein: 

each peptide of every peptide pool has an identical phosphorylatable 
amino acid that can be phosphorylated by a kinase at the phosphorylatable 
amino acid position; 

the query amino acid position is at a defined position relative to the 
phosphorylatable amino acid position within every peptide of every peptide 
pool but a query amino acid's identity at the query amino acid position is 
systematically varied from one peptide pool to the next peptide pool within 
the test set of peptide pools; 

each anchor amino acid position is at a defined position relative to 
the phosphorylatable amino acid position within every peptide of every 
peptide pool and each anchor amino acid position has an identical anchor 
amino acid at that anchor amino acid position within every peptide of every 
peptide pool; 

each degenerate amino acid position within every peptide of every 
peptide pool is occupied by an amino acid from a defined mixture of amino 
acids; and 

the query amino acid position is not adjacent to an anchor amino acid 
position or the query amino acid position is not adjacent to the 
phosphorylatable amino acid position in any peptide pool of the test set. 

2. The test set of claim 1 , wherein at least one anchor amino acid is arginine. 

3. The test set of claim 1, wherein at least one anchor amino acid is proline. 
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4. 



The test set of claim 1, wherein at least one anchor amino acid is 
phenylalanine. 



5. The test set of claim 1, wherein an anchor amino acid position is located one 
5 position C-terminal to the phosphorylatable amino acid position. 

6. The test set of claim 5, wherein proline is the anchor amino acid at the 
anchor amino acid position located one position C-terminal to the 
phosphorylatable amino acid position. 

10 

7. The test set of claim 5, wherein glutamine is the anchor amino acid at the 
anchor amino acid position located one position C-terminal to the 
phosphorylatable amino acid position. 

1 5 8. The test set of claim 5, wherein arginine is the anchor amino acid at the 
anchor amino acid position located one position C-terminal to the 
phosphorylatable amino acid position. 

9. The test set of claim 5, wherein phenylalanine is the anchor amino acid at 
20 the anchor amino acid position located one position C-terminal to the 

phosphorylatable amino acid position. 

1 0. The test set of claim 1 , wherein an anchor amino acid position is located 
three positions N-terminal to the phosphorylatable amino acid position. 

25 

1 1 . The test set of claim 10, wherein arginine is the anchor amino acid at the 
anchor amino acid position located three positions N-terminal to the 
phosphorylatable amino acid position. 

30 12. The test set of claim 1 , wherein every peptide in each of the peptide pools 
comprises less than four anchor amino acids. 
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1 3 . The test set of claim 1 , wherein at least one degenerate position in each 
peptide pool in the test set is occupied by a defined mixture of more than 
five amino acids. 

5 

14. The test set of claim 13, wherein the defined mixture comprises all natural 
amino acids. 

15. The test set of claim 13, wherein the defined mixture comprises all natural 
10 amino acids except cysteine. 

1 6. The test set of claim 13, wherein each amino acid's relative abundance in the 
defined mixture is approximately that amino acid's human proteome relative 
abundance. 

15 

1 7. The test set of claim 13, wherein the defined mixture of amino acids 
comprises proline. 

18. The test set of claim 13, wherein the defined mixture of amino acids 
20 comprises arginine. 

1 9. The test set of claim 1 , wherein the test set has at least four peptide pools 
and each of the four peptide pools have a different query amino acid. 

25 20. The test set of claim 1 , wherein the query amino acid position is two 
positions N-terminal to the phosphorylatable amino acid position. 

2 1 . The test set of claim 1 , wherein the query amino acid position is two 
positions C-terminal to the phosphorylatable amino acid position. 

30 

22. The test set of claim 1 , wherein one query amino acid is proline. 
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23 . The test set of claim 1 , wherein one query amino acid is arginine. 

24. The test set of claim 1 , wherein each peptide pool is a soluble mixture of 
5 peptides. 

25. The test set of claim 24, wherein substantially every peptide is linked to 
biotin. 

1 0 26. The test set of claim 1 , wherein substantially every peptide of every peptide 
pool is attached to a solid support. 

27. A test set for characterizing substrate specificities of kinases comprising at 
least two peptide pools, wherein substantially every peptide in each of the 
1 5 peptide pools comprises one phosphorylatable amino acid position, one 

query amino acid position, and at least one degenerate amino acid position, 
and wherein: 

each peptide of every peptide pool has an identical phosphorylatable 
amino acid that can be phosphorylated by a kinase at the phosphorylatable 
20 amino acid position; 

the query amino acid position is at a defined position relative to the 
phosphorylatable amino acid position within every peptide of every peptide 
pool but a query amino acid's identity at the query amino acid position is 
systematically varied from one peptide pool to the next peptide pool within 
25 the test set of peptide pools; 

each degenerate amino acid position within every peptide of every 
peptide pool is occupied by an amino acid from a defined mixture of amino 
acids; 

the query amino acid position is not adjacent to the phosphorylatable 
30 amino acid position in any peptide pool of the test set. 
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A test set for characterizing substrate specificities of kinases comprising at 
least two peptide pools, wherein every peptide in each of the peptide pools 
comprises one phosphorylatable amino acid position, one query amino acid, 
at least one anchor amino acid position, and at least one degenerate amino 
acid position, and wherein: 

each peptide of every peptide pool has an identical phosphorylatable 
amino acid that can be phosphorylated by a kinase at the phosphorylatable 
amino acid position; 

every peptide of every peptide pool has an identical query amino acid 
but the position of the query amino acid relative to the phosphorylatable 
amino acid position is systematically varied from one peptide pool to the 
next peptide pool within the test set of peptide pools; 

each anchor amino acid position is at a defined position relative to 
the phosphorylatable amino acid position within every peptide of every 
peptide pool and each anchor amino acid position has an identical anchor 
amino acid at that anchor amino acid position within every peptide of every 
peptide pool; 

each degenerate amino acid position within every peptide of every 
peptide pool is occupied by an amino acid from a defined mixture of amino 
acids. 

The test set of claim 28, wherein there are at least three peptide pools. 

The test set of claim 28, wherein there the query amino acid is arginine. 

A binding entity whose binding differentiates between a defined peptide 
having any one of SEQ ID NO: 76, 79, 81, 82, 87, 89-94, 97, 98, 100, 102, 
104, 105, 108, 110, 112, 113, 115, 117, 121, 124, 125, 127-134, 136, 138, 
139, 143-145, 148-153, 156, 160, 163-180, 182-194, 196-206, 208-211,213- 
216 and the corresponding defined peptide after phosphorylation by PKC- 
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theta, and wherein the binding entity has substantially no binding to a 
phosphorylated peptide having SEQ ID NO: 229 (WKN-pS-IRH). 



32. The binding entity of claim 3 1 , wherein the binding entity binds less 
5 efficiently to the defined peptide than to the defined peptide after 

phosphorylation by PKC-theta. 

3 3 . The binding entity of claim 32, wherein the binding entity also has 

substantially no binding to a phosphorylated peptide having SEQ ID NO 230 
10 (RRP-pS-YRK). 

34. The binding entity of claim 32, wherein the defined peptide comprises SEQ 
ID NO:76 (HVRRRRGTFKRSKLRARD). 

15 35. The binding entity of claim 32, wherein the defined peptide comprises SEQ 
ID NO:121 (LRRRSLRRSNSISKSPGP). 

36. The binding entity of claim 32, wherein the defined peptide comprises SEQ 
ID NO:209 (DKEKSKGSLKRK). 

20 

37. The binding entity of claim 3 1 , wherein the binding entity is a polypeptide or 
a mixture of polypeptides sharing a similar binding specificity. 

3 8 . The binding entity of claim 3 1 , wherein the binding entity is an antibody, an 
25 antibody fragment or a mixture thereof. 

39. The binding entity of claim 3 1 , wherein the binding entity binds more 
efficiently to the defined peptide than to the defined peptide after 
phosphorylation by PKC-theta. 
30 40. The binding entity of claim 3 1 , wherein the defined peptide is part of a 
protein. 
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41 . A binding entity whose binding differentiates between a defined 
phosphorylated peptide having any one of SEQ ID NO.298-347, 349-473 
and a non-phosphorylated peptide that differs from the defined peptide by 
substitution of Ser for the pSer or substitution of a Thr for the pThr, and 
wherein the binding entity has substantially no binding to a phosphorylated 
peptide having SEQ ID NO: 229 (WKN-pS-IRH). 

42. The binding entity of claim 40, wherein the binding entity binds more 
efficiently to the defined phosphorylated peptide than to the defined non- 
phosphorylated peptide. 

43. The binding entity of claim 41 , wherein the defined phosphorylated peptide 
comprises SEQ ID NO:298. 

44. The binding entity of claim 4 1 , wherein the defined phosphorylated peptide 
comprises SEQ ID NO:299 or 300. 

45. The binding entity of claim 41 , wherein the defined phosphorylated peptide 
comprises SEQ ID NO:313 or 314. 

46. The binding entity of claim 41 , wherein the defined phosphorylated peptide 
comprises SEQ ID NO:361 or 362. 



25 47. The binding entity of claim 41 , wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO:301-310. 

48. The binding entity of claim 41 , wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO:31 1-320. 

30 
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49. The binding entity of claim 41, wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO.321-330. 

50. The binding entity of claim 4 1 , wherein the defined phosphorylated peptide 
5 comprises any one of SEQ ID NO:33 1-342. 

5 1 . The binding entity of claim 4 1 , wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO:343-347, 349-362. 

10 52. The binding entity of claim 4 1 , wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO:363-382. 

53 . The binding entity of claim 4 1 , wherein the defined phosphorylated peptide 
comprises any one of SEQ ID NO:383-473. 

15 

54. The binding entity of claim 40, wherein the binding entity binds less 
efficiently to the defined phosphorylated peptide than to the defined non- 
phosphorylated peptide. 

20 55. The binding entity of claim 40, wherein the defined phosphorylated peptide 
is part of a protein. 

56. A method for characterizing substrate specificities of kinases comprising: 
contacting each peptide pool in at least two test sets of peptide pools 
25 with ATP and a kinase; 

quantifying the amount of phosphorylation in each peptide pool; and 
comparing the amount of phosphorylation in each peptide pool with 
the amount of phosphorylation in at least one other peptide pool; 

wherein substantially every peptide in each of the peptide pools 
30 comprises one phosphorylatable amino acid position, one query amino acid 
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position, at least one anchor amino acid position, and at least one degenerate 
amino acid position, and wherein: 

each peptide of every peptide pool has an identical phosphorylatable 
amino acid that can be phosphorylated by a kinase at the phosphorylatable 
amino acid position; 

the query amino acid position is at a defined position relative to the 
phosphorylatable amino acid position within every peptide of every peptide 
pool but a query amino acid's identity at the query amino acid position is 
systematically varied from one peptide pool to the next peptide pool within 
the test set of peptide pools; 

each anchor amino acid position is at a defined position relative to 
the phosphorylatable amino acid position within every peptide of every 
peptide pool and each anchor amino acid position has an identical anchor 
amino acid at that anchor amino acid position within every peptide of every 
peptide pool; and 

each degenerate amino acid position within every peptide of every 
peptide pool is occupied by an amino acid from a defined mixture of amino 
acids 

The method of claim 56, wherein quantifying the amount of phosphorylation 
comprises determining a total amount of labeled phosphate incorporated into 
each peptide pool. 

The method of claim 56, wherein quantifying the amount of phosphorylation 
comprises determining a total amount of phosphorylated peptide in each 
peptide pool with an antibody specific for a phosphorylated peptide. 

The method of claim 56, wherein the method further comprises placing a 
value for each amount of phosphorylation into a matrix relating amino acid 
position and amino acid identity with the amount of phosphorylation. 
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The method of claim 56, wherein the matrix is used to predict preferred 
substrate peptide sequences for the kinase. 

A computer readable medium comprising computer-executable instructions, 
wherein the computer-executable instructions comprise conversion of input 
data into quantitative values specifying a preference value for each of a 
plurality of amino acids at each defined position in a substrate peptide for a 
kinase, wherein: 

the input data comprises sequence and phosphorylation data for a test 
set of peptides comprising at least two peptide pools, wherein every peptide 
in each of the peptide pools comprises one phosphorylatable amino acid 
position, and one query amino acid position, wherein: 

each peptide of every peptide pool has an identical phosphorylatable 
amino acid that can be phosphorylated by a kinase at the phosphorylatable 
amino acid position; 

the query amino acid position is at the defined position relative to the 
phosphorylatable amino acid position within every peptide of every peptide 
pool but a query amino acid's identity at the query amino acid position is 
systematically varied from one peptide pool to the next peptide pool within 
the test set of peptide pools; 

a preference value for a particular amino acid at the defined position 
is substantially determined from the amount of phosphorylation of the 
peptide pool wherein that particular amino acid is the query residue and the 
query position is located at the defined position. 

The computer readable medium of claim 61, wherein a ratio between (the 
preference value for one amino acid) and (the preference value for a second 
amino acid) is generally proportional to a ratio between (the amount of 
phosphorylation of the peptide pool in which the first amino acid is the 
query amino acid) and (the amount of phosphorylation of the peptide pool in 
which the second amino acid is the query amino acid). 
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63 . The computer readable medium of claim 6 1 , wherein the difference between 
(the preference value for one amino acid) and (the preference value for a 
second amino acid) is generally proportional to a logarithmic transformation 
5 of the ratio between (the amount of phosphorylation of the peptide pool in 

which the first amino acid is the query amino acid) and (the amount of 
phosphorylation of the peptide pool in which the second amino acid is the 
query amino acid). 

1 0 64. The computer readable medium of claim 61 , wherein the instructions further 
comprise inputting one or more peptide sequences and predicting a 
likelihood of phosphorylation of the one or more peptide sequences of said 
kinase. 
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A method for visual display of amino acid or nucleotide sequence 
preferences comprising a series of stacks of single letter symbols for amino 
acids or nucleotides, wherein 

each stack represents a position in a peptide or a nucleic acid 
sequence; 

each symbol's height is proportional to the absolute value of a 
quantitative parameter that is positive for favored amino acids or nucleotides 
and negative for disfavored amino acids or nucleotides; 

each symbol's position within the stack is sorted from bottom to top 
in ascending value by the quantitative parameter. 



66. A computer readable medium having computer-executable instructions for 
performing a method of visually displaying amino acid or nucleotide 
sequence preferences, the method comprising: 

representing a position in a peptide or a nucleic acid sequence with a 
30 stack of single letter symbols for amino acids or nucleotides; and 
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displaying a 'linear array of one or more stacks of letter symbols 
wherein each letter symbol's height is proportional to the absolute value of a 
quantitative parameter that is positive for favored amino acids or nucleotides 
and negative for disfavored amino acids or nucleotides and wherein each 
letter symbol's position within the stack is sorted from bottom to top in 
ascending order by the value of the quantitative parameter. 

A computer readable medium having computer-executable instructions of 
claim 66, wherein the symbols are single letter codes for amino acids. 

A computer readable medium having computer-executable instructions of 
claim 66, wherein the sequence preferences relate to kinase specificity. 
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